Using n-Grams for Syndromic Surveillance in a Turkish Emergency Department Without English Translation: A Feasibility Study
نویسندگان
چکیده
INTRODUCTION Syndromic surveillance is designed for early detection of disease outbreaks. An important data source for syndromic surveillance is free-text chief complaints (CCs), which are generally recorded in the local language. For automated syndromic surveillance, CCs must be classified into predefined syndromic categories. The n-gram classifier is created by using text fragments to measure associations between chief complaints (CC) and a syndromic grouping of ICD codes. OBJECTIVES The objective was to create a Turkish n-gram CC classifier for the respiratory syndrome and then compare daily volumes between the n-gram CC classifier and a respiratory ICD-10 code grouping on a test set of data. METHODS The design was a feasibility study based on retrospective cohort data. The setting was a university hospital emergency department (ED) in Turkey. Included were all ED visits in the 2002 database of this hospital. Two of the authors created a respiratory grouping of International Classification of Diseases, 10th Revision ICD-10-CM codes by consensus, chosen to be similar to a standard respiratory (RESP) grouping of ICD codes created by the Electronic Surveillance System for Early Notification of Community-based Epidemics (ESSENCE), a project of the Centers for Disease Control and Prevention. An n-gram method adapted from AT&T Labs' technologies was applied to the first 10 months of data as a training set to create a Turkish CC RESP classifier. The classifier was then tested on the subsequent 2 months of visits to generate a time series graph and determine the correlation with daily volumes measured by the CC classifier versus the RESP ICD-10 grouping. RESULTS The Turkish ED database contained 30,157 visits. The correlation (R (2)) of n-gram versus ICD-10 for the test set was 0.78. CONCLUSION The n-gram method automatically created a CC RESP classifier of the Turkish CCs that performed similarly to the ICD-10 RESP grouping. The n-gram technique has the advantage of systematic, consistent, and rapid deployment as well as language independence.
منابع مشابه
Syndromic Surveillance for Influenza in the Emergency Department–A Systematic Review
The science of surveillance is rapidly evolving due to changes in public health information and preparedness as national security issues, new information technologies and health reform. As the Emergency Department has become a much more utilized venue for acute care, it has also become a more attractive data source for disease surveillance. In recent years, influenza surveillance from the Emerg...
متن کاملAutomated Syndromic Classifi cation of Chief Complaint Records
yndromic surveillance, a medical surveillance approach that bins data into broadly defi ned syndrome groups, has drawn increasing interest in recent years for the early detection of disease outbreaks for both public health and bioterrorism defense. Emergency department chief complaint records are an attractive data source for syndromic surveillance owing to their timeliness and ready availabili...
متن کاملA Contrastive Study of Theme in English and Azerbaijani Turkish Fictional Texts
Thematisationis one of the troublesome areas both for translation purposes from or into English and also for learning EFL. The main reason for the problem lies in the fact that usually different languages structure thematisation in different ways. Therefore, the present research is an attempt to investigate contrastively: experiential (topical), interpersonal and textual themes in a sample of A...
متن کاملUsing Sydromic Surveillance to Track E-cigarette Related Emergency Department Visits
Introduction The North Dakota Department of Health (NDDoH) investigated the feasibility of using syndromic surveillance (SyS) data to identify health care visits due to electronic cigarette (e-cigarette) use. E-cigarettes have been associated with injuries and fatalities in all age groups, including young children attracted to the colorful liquid nicotine carriage packaging [1]. Previously, poi...
متن کاملامکانسنجی استفاده از منابع دادههای بالینی و غیربالینی در نظام مراقبت سندرومیک آنفلوانزا: بهکارگیری رویکرد تجزیهوتحلیل همبستگی
Background and Objectives: Syndromic surveillance systems are used to early detection of outbreaks. The purpose of this study was to determine the feasibility of clinical and non-clinical data sources used in influenza syndromic surveillance in Zanjan. Methods: In this time series study, clinical and non-clinical data related to influenza like illness (ILI) as a potential data source of synd...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2013